Physics of evolution: selection without fitness 
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Traditionally evolution is seen as a process where from a pool of possible variations of a popu- 
lation (e.g. biological species or industrial goods) a few variations get selected which survive and 
proliferate, whereas the others vanish. Survival probability is typically associated with the 'fitness' 
of a particular variation. In this paper we argue that the notion of fitness is an a posteriori concept, 
in the sense that one can assign higher fitness to species that survive but one can generally not 
derive or even measure fitness - or fitness landscapes - per se. For this reason we think that in 
a 'physical' theory of evolution such notions should be avoided. In this spirit, here we propose a 
random matrix model of evolution where selection mechanisms are encoded in the interaction ma- 
trices of species. We are able to recover some key facts of evolution dynamics, such as punctuated 
equilibrium, i.e. the existence of intrinsic large extinctions events, and, at the same time, periods 
of dramatic diversification, as known e.g. from fossil records. Further, we comment on two funda- 
mental technical problems of a 'physics of evolution', the non-closedness of its phase space and the 
problem of co-evolving boundary conditions, apparent in all systems subject to evolution. 

PACS numbers: 87.23. Ke, 87.10-e, 87.23. Cc, 02.10.Ox 

I. INTRODUCTION 

The quantitative science of evolution can look back on a series of models dealing with the evolution of biological 
systems and the dynamics of technological innovation. In this context maybe the most important ones are the models of 
Kauffman (see e.g. [1) on biological and Arthur [2J on technological evolution. A traditional way to treat dynamical 
systems of evolution is to use high dimensional catalytic network equations (3J H], the Lotka-Volterra equations 
and the Hypercycle [5] being famous examples. Recently linear versions of these catalytic network equations have 
been shown to demonstrate the influence of topological aspects of the underlying network topology on the resulting 
population dynamics [5] . There the existence of highly populated phases have been demonstrated to coincide with 
the appearance of closed directed feedback loops in the network, the so-called autocatalytic cycles. On the subject of 
extinction dynamics, which constitutes an important branch in understanding evolution, several specific models have 
been brought forward lately [7j [5j [TtJl [TT] . 

Of fundamental importance in all of the above models is the notion of fitness. Since the times of Darwin fitness is 
used as a method for selection, which is fundamental to evolution. Fitness of an entity is a function depending on a 
large number of parameters, describing both the state of the entity and the state of its environment. Due to the high 
dimensional parameter space one often talks about fitness landscapes. These landscapes may be rather complicated 
and hard to optimize. As useful as it may be for an intuitive feeling of how selection works, the notion of fitness is 
clearly an a posteriori concept. An entity is fit if it had survived and proliferated well, and unfit if it went extinct. 
The same is true for the concept of ecological niches. Space that never got occupied by entities will not count as a 
niche. These concepts could be seen as ambiguous, and anthropomorphic. Fitness and niches can not be measured in 
reality in much the same way as utility functions can usually not be measured in economics. 

If there exists something like a physics of evolution it would be necessary to phrase models without using an- 
thropomorphic concepts and, at the same time, provide a framework abstract enough to capture the universality 
behind e.g. technological and biological evolution in a unified way. This universality manifests itself in the fields of 
biological evolution, industrial innovation, socio-dynamics, economy, finance, opinion formation, ecological dynamics, 
e.g. food-webs, and history. In the following we would like to avoid these notions in reasoning about evolution all 
together. We propose a model able to explain several facts about evolutionary dynamics - such as the existence of 
intrinsic large extinctions events and booms of diversity of species over time (punctuated equilibrium), see Fig. [I] - 
based on three guiding principles: low resolution, parsimony and maximum ignorance. Resolution of detail is limited 
in many evolutionary time series. Practically one can often only resolve and understand the dynamics of coarse 
variables as, for example, the diversity of the system, i.e. how many different species arc actually evolving together. 
Parsimony (Occam's Razor) calls for simplistic models based on a minimum of resolvable explanatory parameters. 
Finally, maximum ignorance suggests to model interactions which are not known in detail on the basis of random 
matrices (tensors) as introduced by Wigner [13]. 

Generally speaking, evolution can be understood as an iterative three-step process. Step 1: a new thing comes 
into existence within a given environment, usually on the basis of existing things by means of substitution and 
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FIG. 1: Number of biological genera over time. Clearly visible are the mass extinctions marked by triangles, often followed by 
periods of massive diversification (punctuated equilibrium). Time runs from right to left. Data after [12J . 

combination processes. Environment is defined as the set of all existing things in the system. Step 2: The new thing 
has the chance to interact with the environment. As a result of this interaction the thing gets selected (survives) or 
de-selected (destroyed, suppressed). Step 3: In case the new thing gets selected within this environment it becomes 
part of the environment for other things, thereby changing the conditions for both - existing things and future new 
things yet to arrive. The new thing, if selected, has the chance to 'proliferate' in terms of substitution and combination 
processes with itself or other existing things. 

It is clear that such a process is far from being of Newtonian nature as usually dealt with in physics. The most 
obvious problem arises from the innovations happening in the course of the dynamics: the phase space of the system 
is not closed. Whenever a satisfactory set of variables is found that describes the history of the system well, at 
some point this set of variables becomes insufficient to describe states of the system in the future. For example, 
before television was invented history of mankind can be understood without television, whereas it gets increasingly 
impossible to understand the history of man after the invention without taking the secondary effects of television 
on society into account. The second major problem is that of co-evolving boundary conditions, which all evolution 
systems share. As the system evolves (new things come into being) the boundary conditions (interaction possibilities 
with others) constantly change. This makes such systems extremely hard to phrase in terms of differential equations. 
Since it is impossible to forecast the invention of entities and new variables associated with them, it seems reasonable, 
as a starting point, to model their respective interactions based on random interactions. 

As stated above new things usually come into existence through a process of (re-) combination or substitution of 
already existing things. Examples range from sexual reproduction to the assembly of an ipod from its components. 
Thus models for evolutionary systems minimally require two components: entities (things, species, products, individu- 
als, etc.) and rules how these entities can be used in combination with each other to produce new things (combination 
and substitution). For example there is a rule that the combination of hydrogen and oxygen can form water, or there 
is a rule that a male and female chicken can become parents of baby chicken. However there is no rule that a fish and 
a dog can be the parents of a chicken, nor is it possible to combine two blocks of U 235 into one big block of uranium. 
The set of all combination/substitution rules we keep in a 'rule table' denoted as a in the following. 

Suppose we characterize the relative frequency of entity i by Xi > and denote the rule whether entity i can be 
produced from entities j and k by ccyfc. If no such production rule exists, then a^t = and else Qyt > 0, when such 
a rule exists, a^t represents the rate with which j and k can produce i. For simplicity one can view a as Boolean, 
i.e. a £ {0, 1}. If aijk > we call (j, k) a creative pair producing (or 'pointing' to) i. The dynamics of the relative 
frequencies of the entities can be expressed by a network equation of the type 



The quadratic term suggests that i forms proportional to the abundance of j and k weighted by the rate o^fe. The 
$ term assures that the abundance of entities is proportional to the relative frequencies Xi, with Y] Xj = 1. 

Let us assume that the dimension of the system, denoted as d, i.e. the number of all possible entities i, is extremely 
large, but finite. If we know all i and otijk we could specify an initial condition x(t — 0) and solve the differential 
equation (fxj . However, neither d nor a is known in general. So how can one proceed? In previous work |14l I15j we 
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FIG. 2: From left to right: within some initial set of entities (blue dots) there exist several creative pairs, indicated by gray 
ovals. In the next timestep these pairs produce new entities (red dots). The new entities become part of the environment (red 
dots turn blue). With the new entities new productive pairs can be formed to again produce new entities. 



suggested the following combinatorial approach. Suppose we knew the number of productive pairs N + = . k Oiptijk) 
(where 6{x) = 1 for x > and 0(x) = otherwise) in the system and define the creative pair density r + = N + /d, 
which is the average number of productive pairs leading to one specific product. Suppose further that the topology 
of the productive catalytic network is random (maximum ignorance), then one can to use combinatorial arguments 
to map the catalytic network equation (JT|) onto the recurrence relation, |14j 

Ot+i = a t + Aa t 

Aot+i = r+ (1 - a t+1 ) (a 2 t+1 - a 2 ) , 

where at = ^2i&(xi(t))/d is the relative diversity of the system at time t, i.e. the fraction of possible entities with 
non-zero abundance. Here the initial condition is do and o_i = by definition; the rule density is r + . For a schematic 
iterative view of a dynamical increase of diversity see Fig. [2] It is remarkable that this equation does not explicitly 
depend on d any more and therefore is valid for arbitrarily large d. The increment Aa in Eq. ^ can be understood. 
r + a 2 is proportional to the number of productive pairs in a t . However, we have to subtract the productive pairs that 
have already been considered in the last time-step so that at time t only r + (a 2 — a 2 _ x ) new pairs have to be taken 
into account. Of these pairs only a fraction (1 — at) can be expected to produce something that does not already 
exist. The resulting update equation Eq. ([2| can be solved asymptotically by analytical means [TJ] and shows that 
the system has a phase transition in the final diversity doo. If r + > r^ rit , with a critical value r^ rit ~ 2, then there 
exists a critical value a cr i t {r + ) where there occurs a discontinuous jump in the final diversity from low to high final 
diversity. The transition is equivalent to the van der Waal's gas phase transition. This phase transition is shown in 
Fig. ([3| where the final diversity aoo is plotted over the initial diversity ao and the creative pair density r + . We have 
analyzed the stability of the fully populated state under external suppression of entities (initial defects) which also 
leads to scale-invariant update equations for the propagation of secondary defects. Here also a transition from almost 
linear behavior with respect to the initial defects to a break-down of the complete system can be demonstrated, |15j . 



II. AN EVOLUTION MODEL WITHOUT FITNESS 



The above creative catalytic network dynamics does not yet constitute a model for evolution. The selection mech- 
anism as an intrinsic part of an evolution system is missing. The task is how to introduce selection without changing 
the set-up and without reference to anthropomorphic a posteriori concepts like fitness, niches, selection pressure, etc. 

A straight forward way to achieve this, is to include suppressive pairs (j, k) with ay/. < such that the (j, k) 
suppresses the existence of i. Rule table otijk now encodes creation and suppression, r + remains the density of 
productive rules, and the density of suppressor rules, denoted by r - , is defined analogously as r~ = N~ /d, with 
N~ = J^i j k ®(~ a ijk)- A schematic view of the model is presented in Fig. El 

The influence of suppressors on the final diversity of the system can be estimated analytically. For instance, if a^ 
is the final diversity for the case without suppressors, then the asymptotic diversity of a system with a suppressor 
density of r~ can be derived on the basis of combinatorial arguments to be 7000, where 

This result has been shown to coincide reasonable well with simulations. Unfortunately it turns out that the diversity 
dynamics with suppressors is rather resistant to a more detailed analytical analysis and a numerical version of the 
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FIG. 3: Final diversity Ooo plotted over the initial diversity ao and the creative pair density r displaying a van der Waals 
like phase transition from low to high final diversity. 
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FIG. 4: Left: Within some initial population of entities (blue dots) we find creative pairs (gray ovals) as well as destructive 
pairs (yellow ovals). In the next timestep creative pairs produce new elements (red dots), suppressive pairs annihilate existing 
elements (crossed red dots). The resulting population with its potentially new pairs serve as a new initial population for the 
next timestep. 



model has to be implemented on the basis of the parameters r + , r and ao to learn more about the details of the 
dynamics of such systems. 



A. Implementation of the model 

We have implemented the most simple version of the above model. We consider only binary states Xi € {0, 1}, 
an element is present or absent, and a rule table with entries ot\jk € {—1,0, 1}. First we sample r + d random triples 
(i, j, k) and assign them a value — 1. On the set of remaining triples we do the same for r_ and assign a^k = — 1- 
All remaining entries in a are zero. Next we randomly sample an initial condition such that exactly a^d components 
in the initial species vector x(t — 0) are one; all others are zero. The elements Xi(t) are updated to time t + 1 in 
a random sequential order. For each node i we count the number n + and n~ of productive and suppressive pairs 
pointing to node i, respectively. If n + + nT > the node will be set to 1 with a probability p\ = n + /{n + + fj,n~), 
and to with probability po = 1 — p\ . Here fi is a parameter that specifies the relative strength of suppressive pairs 
over creative pairs. For example, if p — 1 and n + = n~ then the chance for an entity i to be created or deleted is 0.5. 
If neither a creative pair nor a destructive pair is pointing at node i, i.e. n + + n~ = then i does not get actively 
suppressed but also not actively produced. In this case i will continue to exist in the next time step with a probability 
A (sustain rate), in other words, entities that are not actively produced decay with a rate 1 — A. At this point we can 
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decide to protect the initial condition x(0) or the initial diversity ao, i.e. once the diversity a(t) has dropped down to 
the level of ao no further entity can be eliminated in the update. However, we found that both types of enforcing a 
minimum diversity lead to very similar characteristics of the dynamics. This protection of entities models e.g. sources 
of renewable goods which are neither subject to selection nor to decay. 

The dynamics of species in vector x(t) can be seen as a point in phase space. Two consecutive points in time have 
shifted by an 'angle' S(t) 



cos S(t) 



x(t)x(t-l) 

\x(t)Mt-i)\ 



(4) 



which can be regarded as a potential measure for the relative change of the population of entities and thus for the 
size of creative/destructive effects within this timestep. 
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FIG. 5: Two different runs with the same parameters for two different random seeds of the rule table a. One exhibits meta- 
bi-stable dynamics (left) the other low-diversity dynamics (center). In both cases the suppression factor is fi — 1.5 and the 
sustain rate A = 0.08. The initial condition :r(0) was protected. Statistics for the moments of the diversity dynamics for r + = 3 
and ao = 0.1 fixed is shown based on 25 runs with different random topologies per r~ (right). 



B. Results 



Most importantly our simple model is capable to produce meta-bi-stable dynamics, or punctuated equilibrium as 
called in biology. In Fig. [5] two runs for identical parameter settings (left and center) are shown. The two runs with 
identical dynamical parameters show meta-bi-stable dynamics in the one case and sub-critical dynamics in the other. 
The only difference appears in the random realization of the rule table a. The role of the parameters [i (relative 
suppressor strength) and A (sustain rate) are found to have only a minor influence on the qualitative behavior of the 
dynamics. In the right image of Fig. [5] the first moments of the diversity timeseries are shown as a function of r — , 
for all other parameters fixed. We see that the mean diversity decreases with increasing r~ just as we may expect 
fromEq. while the variance is below a critical r crit where the variance has a peak and then decays for r > r cr it . 
The simulation data we have analyzed indicates that this behavior is widely independent of the system size d, which 
was typically between 100 and 1000. We compared the characteristics of the diversity dynamics with direct numerical 
solutions of the catalytic-network equation Eq. (JT|) and found qualitatively comparable behavior on small systems. 
In case the time-evolution of the system-diversity is meta-bi-stable, the system seems to be critical. In Fig. ([6| the 
clearly non-Gaussian angular velocity distribution of 6 is shown. 



III. DISCUSSION 



We have proposed a simple model of evolution that is based on a set of three relevant parameters r + , r~ and ao- 
No anthropomorphic a posteriori concepts like fitness or ecological niches are necessary in the model, yet it is capable 
of explaining punctuated equilibrium in diversity timeseries, which is the most striking feature in experimental data, 
such as e.g. fossil records. The creative processes are modeled as a consequence of constructive interactions between 
pairs of entities while selection happens implicitly through suppressors, i.e. the existence of certain elements prevents 
(suppresses) the existence of others. A posteriori it becomes of course possible to interpret fitness and niches in terms 
of the interaction topology. However, the model indicates that fitness is maybe more a property of the collective 
state of a system than a property of the individual entities. When the state of the system is such that suppression is 



6 



1.6 



1.58 



timeseries 



10' 




angle 



10 



1000tg 



cr 
CD 



1.54, 



10 



5000 10000 




-8.02 0.02 

angular velocity 



FIG. 6: Diversity (left upper panel) and angular velocity (left lower panel) over time in a particular run displaying meta-bi- 
stable dynamics. The angular velocity distribution is shown in the right plot. 



minimized and creativity maximized the whole system thrives while in case too many suppressors are supported the 
whole system breaks down in a 'creative gale of destruction' |16j . 

Most remarkably, already the simple model we have implemented displays meta-bi-stable dynamics - which is 
exactly what we would expect to find - for a wide range of r~ , i.e. for r + > rj rit , ao > a cr i t (r + ). One may conclude 
from this that there is a non-vanishing probability to sample evolutionary systems with meta-bi-stable evolution by 
pure chance. 

The behavior of mean and variance indicate that when the system is prepared critically (ao > a cr it) then for small 
r~ the system will behave supra-critical and approaches a plateau where it stays and the variance is small. When 
r~ increases we find an onset of meta-bistable evolution. The probability for this behavior seems to be maximal 
at the critical value of r~ where the variance is maximal. For larger r~ the decreasing variance indicates both a 
decreasing probability to find meta-bi-stable dynamics and a decreasing amplitude of the high-diversity plateau. The 
meta-bi-stable scenarios seem to be critical as indicated by the non-Gaussian distribution of the angular speed S of 
Eq. @. 

Finally, since the parameters ao, r + , and r~ are insufficient to fully determine whether the dynamics resulting 
from some random topology a of the catalytic network will be meta-bi-stable, sub-critical, or supra-critical, this set 
of parameters is incomplete. The occurrence of meta-bi-stability or punctuated equilibrium has to be associated 
with topological properties of the catalytic network a. These properties, such as auto-catalytic cycles, have a non 
vanishing chance to be randomly sampled. It will be a fascinating task to relate the statistics of occurrence of such 
topological structures with the observation of punctuated equilibrium. 

This work was supported by Austrian Science Fund FWF Projects P17621 and P19132. We thank the orga- 
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